AITopics

2605.0548

Country: Europe > Italy (0.28)

Genre: Research Report (0.50)

Industry: Health & Medicine (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.88)

arXiv.org Machine LearningMay-20-2026

Factor Augmented High-Dimensional SGD

Li, Shubo, Han, Yuefeng, Yu, Xiufan

Stochastic gradient descent (SGD) has been a cornerstone of machine learning since the pioneering work of Robbins & Monro (1951). Beyond its algorithmic simplicity and scalability, SGD has also become a central object of theoretical study, with refined analyses linking its dynamics to implicit regularization, generalization performance, and algorithmic stability. For decades, theoretical analyses of SGD have largely resided within the realm of classical stochastic approximation (Polyak & Juditsky, 1992; Lai, 2003; Bottou et al., 2018), where the data dimension is considered fixed while the sample size tends to infinity. While this regime has yielded foundational insights, it no longer fully reflects the characteristics of modern learning systems. Contemporary applications often operate in regimes where data dimension, sample size, and model complexity grow together, calling for new theoretical tools and perspectives that go beyond traditional asymptotic analyses. In this study, we focus on the learning tasks involving high-dimensional predictors. When SGD is applied directly to such data, the dimensionality of the feature space propagates into the optimization process, resulting in a highdimensional (HD) parameter space. Algorithmically, one trending strategy is to approximate the gradient updates using a low-rank representation to reduce memory costs and accelerate computation (Wang et al., 2018; Vogels et al., 2019; Kozak et al., 2019; Kasiviswanathan, 2021; Zhao et al., 2024). Theoretically, despite the vast literature on SGD, convergence guarantees of HD-SGD remain limited (Garrigos & Gower, 2023; Li et al., 2025).

artificial intelligence, factor model, machine learning, (16 more...)

2605.19291

Country: North America > United States (0.46)

Genre: Research Report > New Finding (0.88)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.90)

Opponents Artist David KittDavid Kitt (born 1975 in Dublin Ireland) isan Irishmusician.

machine learning, natural language, section 4, (14 more...)

Country:

Europe > Ireland > Leinster > County Dublin > Dublin (0.24)
North America > Canada > Ontario > Toronto (0.05)
North America > United States > New York (0.04)
(4 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.40)

Neural Information Processing SystemsFeb-8-2026, 16:16:33 GMT

63d5fb54a858dd033fe90e6e4a74b0f0-AuthorFeedback.pdf

csg tree, opération, remark 1, (14 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.31)

Neural Information Processing SystemsFeb-7-2026, 21:45:11 GMT

294e09f267683c7ddc6cc5134a7e68a8-AuthorFeedback.pdf

coda data, experiment, subspace, (13 more...)

Technology: Information Technology > Artificial Intelligence (0.31)

Hosseini, Bamdad, Huang, Ziqi

Error Analysis of Bayesian Inverse Problems with Generative Priors

arXiv.org Machine LearningJan-27-2026

Data-driven methods for the solution of inverse problems have become widely popular in recent years thanks to the rise of machine learning techniques. A popular approach concerns the training of a generative model on additional data to learn a bespoke prior for the problem at hand. In this article we present an analysis for such problems by presenting quantitative error bounds for minimum Wasserstein-2 generative models for the prior. We show that under some assumptions, the error in the posterior due to the generative prior will inherit the same rate as the prior with respect to the Wasserstein-1 distance. We further present numerical experiments that verify that aspects of our error analysis manifests in some benchmarks followed by an elliptic PDE inverse problem where a generative prior is used to model a non-stationary field.

artificial intelligence, inverse problem, machine learning, (19 more...)

2601.17374

Country: North America > United States (0.46)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.46)

arXiv.org Machine LearningDec-5-2025

Concentration bounds for intrinsic dimension estimation using Gaussian kernels

Andersson, Martin

We prove finite-sample concentration and anti-concentration bounds for dimension estimation using Gaussian kernel sums. Our bounds provide explicit dependence on sample size, bandwidth, and local geometric and distributional parameters, characterizing precisely how regularity conditions govern statistical performance. We also propose a bandwidth selection heuristic using derivative information, which shows promise in numerical experiments.

dimension, kernel, lemma 3, (13 more...)

2512.04861

Country: Europe > Sweden > Uppsala County > Uppsala (0.04)

Genre: Research Report (0.82)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)

Neural Information Processing SystemsOct-3-2025, 02:12:53 GMT

we will extend the submission with discussions from below. 2

We thank the reviewers for their insightful comments. In this rebuttal, we respond to remarks from reviews. Remark 1 The work lacks discussion about the comparison of interpretability with BSP-Net. Moreover, their CSG structure is fixed by definition. CSG trees for different instances (see Figure on the right). Remark 2 Only a single instance of CSG visualization for each class is shown.

artificial intelligence, csg tree, machine learning, (16 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.31)